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The present invention provides mutants of the Green Fluorescent Protein (GFP) of Aequorea victoria. Specifically provided by the 
present invention are nucleic acid molecules encoding mutant GFPs, the mutant GFPs encoded by these nucleic acid molecules, vectors 
and host cells comprising these nucleic acid molecules, and kits comprising one or more of the above as components. The invention also 
provides methods for producing these mutant GFPs. The fluorescence of these mutants is observable using fluorescein optics, making the 
mutant proteins of the present invention available for use in techniques such as fluorescence microscopy and flow cytometry using standard 
FITC filter sets. In addition, certain of these mutant proteins fluoresce when illuminated by white light, particularly when expressed at 
high levels in prokaryotic or eukaryotic host cells or when present in solution or in purified forni at high concentrations. The mutant GFP 
sequences and peptides of the present invention are useful in the detection of transfection, in fluorescent labeling of proteins, in construction 
of fusion proteins allowing examination of intracellular protein expression, biochemistry and trafficking, and in other applications requiring 
the use of reporter genes. 
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Mutants of Green Fluorescent Protein 



BACKGROUND OF THE INVENTION 

Field of the Invention 

This invention is in the fields of molecular and cellular biology. More 
particularly, the invention is directed to mutants of the genes encoding Green 
Fluorescent Protein (GFP) and the proteins encoded by these mutants. The 
mutant GFPs are used to allow detection of eukaryotic and prokaryotic cells 
transfected or transformed with extrinsic genes, and to label proteins of interest 
to facilitate their localization within viable cells. 

RelaiedArt 

Transfection of Foreign Genes 

To study the function of a gene, a technique that is commonly employed 
is the transfer of the gene into a new cellular environment. This process, called 
"transfection," provides several advantages to the genetic scientist. For example, 
the cellular protein encoded by the gene can often be more easily studied by 
transferring the gene into a cell or organism that normally does not produce the 
protein, and then examining the effect of this protein on the host cell. The 
existence and fiinction of regulatory genetic sequences (e.g., promoters, inhibitors 
and enhancers) may be elucidated by transfection of foreign genes into cells 
containing the regulatory sequences. The transfer of non-native or altered genes 
into a host cell also allows for large-scale production of the proteins encoded by 
the genes, a process upon which much of the current biotechnology industry is 
based. Transfection of plant ^bryos with foreign genes has provided genetically 
engineered plants that are more resistant to adverse environmental conditions or 
that are more nutritionally rich. Finally, gene transfer methods allow the 
introduction of new or mutated genes into whole organisms. This latter capability 
provides the opportunity for the construction of stable models of mammalian 



diseases, for large-scale production of proteins in the milk of transgenic lactating 
animals, and for the possibility of genetic therapy for certain diseases. 

A variety of techniques has been used to transfect non*native genes into 
cells (reviewed in Sambrook, J., et al.^ Molecular Cloning, a Laboratory Manual, 
2nd Ed., Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press, pp. 
16.30-16.55 (1989); Watson, J.D., etoL, Recombinanl DMA, 2nd Ed, New York: 
W.H. Freeman and Co., pp. 213-234 (1992)). These techniques include biological 
methods such as the use of viruses (e.g., adenovirus or certain retroviruses for 
mammalian cells, baculovirus for insect cells and bacteriophages for bacterial cells) 
or bacteria (e.g, Agrobacterium for plant cells), chemical methods such as 
calcium phosphate precipitation, DEAE-dextran-mediated endocytosis or 
liposome-mediated transfection, and physical methods such as electroporation or 
direct microinjection. For transfection of mammalian cells, the techniques most 
commonly employed currently are virus-mediated transfection, lipofection and 
electroporation. 

Detection of Gene Transfer 

Regardless of the method used, however, simply attempting to transfect 
a cell does not guarantee that a majority (or even any) of the target cells vwll take 
up and/or express the exogenous DNA. Indeed, it has been suggested that the 
success rate of even the most optimal techniques used for transfection results in 
stable transfer of exogenous DNA is far less than 1% (Watson, J.D,, et al. 
Recombinant DNA, 2nd Ed, New York: W.H. Freeman and Co., pp. 216, 218 
(1992)). Thus, it is usually critical to determme which target cells have received 
and/or incorporated the gene(s) being transfected, for which a number of 
methodologies have been used. 



Expression 

The most obvious of these methods is to simply examine the target cells 
for expression of the exogenous gene. In this method, the transfected cells are 
grown in vitro and assayed for the presence of the protein encoded by the 
transferred gene. These assays are usually accomplished using immunological 
techniques such as Western blotting, ELISA or RIA. This type of technique is 
only useful, however, if the protein is produced in relatively high amounts 
(generally at the microgram levd or above) and if suitable antibodies are available, 
neither of which is the case for some transfected genes. 

In those cases where protein expression cannot be examined, incorporation 
of exogenous genes can be determined by assaying the target cells for production 
of the mRNAs corresponding to the transferred genes. One very common 
technique for this determination is Northern blotting (AJwine, J.C., et aL, Proc. 
Natl Acad Sci, USA 7^:5350-5354, 1977), in which RN A molecules are isolated 
from cells, separated by gel electrophoresis and electroblotted onto a solid support 
(e.g., nitrocellulose or nylon). The solid support is then overlaid with 
radiolabelled cDNAs corresponding to the transfected gene, which hybridize on 
the solid support to their complementary mRNAs. After exposing the blot to 
photographic film, the samples containing the expressed transgene are easily 
determined. While this method is more sensitive than those directly measuring 
protein expression. Northern blotting still relies on actual expression of the gene 
by the target cells, which is not always the case. 

SelecHon 

Another method for determining gene transfer, alternative to directly 
measuring gene expression, is to examine the effect of the gene on the transfected 
cells. For example, some transfected genes will confer upon their host ceils the 
ability to grow in selective culture media or under some other environmental stress 
which non-transfected cells cannot tolerate. Genes of interest are often engineered 



into sequences conferring, for example, antibiotic resistance upon the recipient 
cells. Transfectants with these constructs will thus carry not only the gene of 
interest but also the antibiotic resistance gene which allows them to grow in 
antibiotic-containing media. Since non-transfected cells will not possess this 
resistance, any cell able to grow in media containing antibiotic will contain the 
resistance marker (the so-called "selectable marker") cmd the transgene that is 
linked to it. Selectable markers commonly used in such an approach are the 
neomycin {neo\ ampicillin {amp) and hygromycin {hyg) resistance genes. 

In the same way, selectable markers conferring on the transfected cells a 
metabolic advantage (e.g., ability to grow in nutrient-deficient media) have been 
used successfully. Examples of these types of selectable markers include 
thymidine kinase (Bacchetti, S., and Graham, F.L., Proc. Natl Acad ScL USA 
7^:1590-1594 (1977); Wigler, M., et al. Cell 77:223-232 (1977)) and xanthine- 
guanine phosphoribosyltransferase (Mulligan, R.C., and Berg, P., Proc, Nail 
Acad ScL USA 75:2072-2076 (1981)), which impart to their recipients the ability 
to grow, using metabolic rescue pathways encoded by the marker genes, in media 
that inhibit vital metabolic pathways m non-transfected cells. Again, any cells able 
to grow in such media will contain the transgene linked to the marker gene. 

Selection methods such as these often require weeks of culturing of the 
cells, continuously undo- selective pressure, to provide a relatively pure population 
of stable transfectants. Many uses of transfected cells, however, are conducted 
within hours of transfection, far too soon to determine transfection success usmg 
either the expression or selection methods described above. These types of 
applications are fadlitated by a third approach - the use of "reporter genes". 

Reporter Genes 

Reporter genes are analogous to selectable markers in that they are co- 
transfected into recipient cells with the gene of interest, and provide a means by 
which transfection success may be determined. Unlike selectable markers. 
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however, reporter genes typically do not confer any particular advantage to the 
recipient cell. Instead reporter genes, as their name implies, indicate to the 
observer (via some phenotypic activity) vAich cells have incorporated the reporter 
gene and thus the gene of interest to which it is linked, A number of reporter 
genes have been used, including those operating by biochemical or fluorescent 
mechanisms, each with its own advantages and limitations. 

Biochemical Reporter Genes 

Some commonly used reporter genes encode enzymes or other 
biochemical markers which, when active in the transfected cells, cause some 
visible change in the cells or their environment upon addition of the appropriate 
substrate. Two examples of this type of reporter sequence are the E. coli genes 
lacZ (encoding P-galactosidase or "P-gal") and gusA or iudA (encoding P- 
glucuronidase or "P-glu"); the former is often used as a reporter gene in animal 
cells (Hall, C.V., et al, 1 Mol. Appl Genet 2:101-109 (1983); Cui, C, et al, 
TrangemcRes. 3:182-194 (1994)), the latter in plant ceUs (Jefferson, R.A., Nature 
3^2:837-838 (1989);Watson, J.D., etaL, Recombinant DNA, 2nd Ed, New York: 
W.H. Freeman and Co., pp. 281-282 (1992); Hull, G.A., and Devic, M., Meth. 
MoL BioL ^9:125-141 (1995)). These bacterial sequences are useful as reporter 
genes because the recipient cells, prior to transfection, express extremely low 
levels fif aity) of the enzyme encoded by the reporter gene. When transfected cells 
expresang the reporter gene are incubated with an appropriate substrate (e.g., X- 
gal for p-gal or X-gluc for p-glu), a colored or fluorescent product is formed 
which can be detected and quantitated histochemically or fluorimetrically. 

Another often-used reporter gene is the bacterial gene encoding 
chloramphenicol acetyltransferase (CAT), which catalyzes the addition of acetyl 
groups to the antibiotic chloramphenicol (Gorman, CM., et al, Mol Cell 
Biol 2:1044-1051 (1982); Neumann, J,R., et al, BioTechniques 5:444-446 
(1987); Eastman, A., BioTechniques 5:130-732 (1987); Feigner, P.L., et aL, Ann. 
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KY. Acad ScL 772:126-139 (1995)). After transfection, recipient cells are lysed 
and the lysates are incubated with radiolabelled chloramphenicol and an acetyl 
donor such as acetyl-CoA, or with unlabeled chloramphenicol and radiolabeled 
acetyl-CoA (Sleigh, M. J., .4wa/.5/oc//ew. 756:251-256(1986)). If expressed in 
the cells, CAT transfers acetyl groups to chloramphenicol, which is then easily 
assayed by chromatographic techniques, thereby giving an indication of the 
incorporation of the co-transfected gene of interest by the recipient cells. 

Using reporter genes in this way, populations of cells, or even single cells, 
can be rapidly assayed for their incorporation of the exogenous gene linked to the 
reporter gene. Since they do not rely directly on the expression of the gene of 
interest, assays of transfection success using reporter genes are usually simpler and 
more sensitive than those measuring mRNA or protein production from the 
transgene (Watson, J.D., et ai. Recombinant DNA, 2nd Ed, New York: W.H. 
Freeman and Co., p. 155 (1992)). However, the use of reporter genes is severely 
limited in that it usually requires sacrifice (fixation) of the cells prior to assay, and 
therefore cannot be used for assaying living cells or cultures. Thus, alternative 
means for determining the incorporation of the transgene in viable cells have been 
developed. 

Fluorescent Reporter Genes 

An example of viable reporter genes that are rapidly gaining widespread 
use are those that are fluorescence-based. These genes encode proteins which are 
either naturally fluorescent or which convert a substrate from nonfluorescent to 
fluorescent. Assays using this type of reporter gene are non-destructive and, 
owing to the availability of sophisticated fluorescence detection systems, are often 
more sensitive than biochemical reporter gene assays. 

One example of a fluorescence reporter gene is the luciferin-luciferase 
system (Bronstein, I., etal.Anal Biochem, 279:169-181 (1994)). This system 
utilizes the gene for luciferase, an ATPase enzyme isolated from fireflies (Gould, 
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S J., and Subramani, S., Anal. Biochem. 775:5-13 (1988)) and other beetles 
(Wood, K.V., eiaL, J, Biolumin Chemiltmin -/:289'301 (1989)), or from certain 
bioluminescent bacteria (Stewart, G.S., and Williams, P., J. Gen. Microbiol 
73«:1289-1300 (1992); Langridge, W., etal., J. Biolumin, Chemilumin. 9:185- 
200 (1994)). For use as a reporter gene, the luciferase gene is placed into a vector 
also containing the gene of interest, or separate vectors containing the luciferase 
gene and the gene of interest are mixed together. Cells are then transfected with 
the vector(s) and treated wth the luciferase substrate luciferin which is rendered 
luminescent (and impermeant) intracellularly by the action of the luciferase. Cells 
containing the luciferase gene, and thus the gene of interest linked to it, can then 
be rapidly and sensitively observed using luminescence detectors such as 
luminometers. 

To provide a further increase in sensitivity, attempts have been made to 
use genes from certain cyanobacteria which encode naturally fluorescent 
phycobiliproteins such as phycoerythrin and phycocyanin. These proteins are 
among the most highly fluorescent known (Oi, V.T., etal^ J, Cell Biol Pi:981- 
986 (1982)), and systems have been developed that are able to detect the 
fluorescence emitted from as little as one phycobiliprotein molecule (Peck, K., et 
al, Proc. Natl Acad ScL USA «6:4087-4091 (1989)). PhycobUiproteins also 
have the advantage of being naturally fluorescent, thus eliminating the time- 
consuming steps of the addition of exogenous substrates for their detection as is 
required for luciferase and biochemical reporter genes. However, the 
phycobiliproteins have provai extremely difficult to engineer into gene constructs 
in such a way as to maintain their fluorescence (Hdm, R., et al. , Proc. Natl Acad 
ScL USA 97:12501-12504 (1994)), and thus are not commonly used as reporter 
genes in assaying the transfection of mammalian cells. 

Thus, the ideal reporter gene would encode a naturally fluorescent protein 
(for ease of use following transfection) that is highly fluorescent (for increased 
sensitivity) and easily engineered (for maintenance of fluorescence). Such a 
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system has recently been developed, using the Green Fluorescent Proteins (GFPs) 
isolated from certain marine cnidarians. 

GFP 

Cfvemew 

5 GFPs are involved in bioluminescence in a variety of marine invertebrates, 

including jellyfish such as Aequorea spp. (Morise, H., et al. Biochemistry 
75:2656-2662 (1974); Prendergast, RG, and Mann, K.G, Biochemistry 77:3448- 
3453 (1978); Ward, W.W, Phoiochem, PhotobioL Rev. 4:U51 (1979) and the sea 
pansy Renilkt remformis (Ward, W.W., and Cormier, M.J., Phoiochem. 

10 PhotobioL 27:389-396 (1978); Ward, W.W., ei aL, Phoiochem, PhotobioL 

57:61 1-615 (1980)). The GFP isolated from Aequorea victoria has been cloned 
and the primary amino acid structure has been deduced (Figure 1 ; Prasher, D.C., 
et aL, Gem J J 1:229-233 (1992)) (SEQ ID NOs:l, 2). The chromophore of A, 
victoria GFP is a hexapeptide composed of amino acid residues 64-69 in which 

15 the amino acids at positions 64-67 (serine, tyrosine and glycine) form a 

heterocyclic ring (Prasher, D.C, etaL, Gene 777:229-233 (1992); Cody, C.W., 
etoL, Biochemistry 32:1212-1218 (1993)). Resolution of the crystal structure of 
GFP has shown that the chromophore is contained in a central a-helical region 
surrounded by an 1 1-stranded P-barrel (Ormo, M., et al.. Science 273: 1392-1395 

20 (1996); Yang, F., et aL, Nature Biotech 7^:1246-1251 (1996)). Upon 

purification, native GFP demonstrates an absorption maximum at 395 nanometers 
(nm) and an emission maximum at 509 nm (Morise, H., et al.. Biochemistry 
73:2656-2662 (1974);Ward, W.W., et aL, Photochem. PhotobioL 37:611-615 
(1980)) with exceptionally stable and virtually non-photobleaching fluorescence 

25 (Chalfie, M., et aL, Science 263:802-805 (1994)), 

While GFP has been used as a fluorescent label in protem localization and 
conformation studies (Heim, R., et aL, Proc, NatL Acad. ScL USA P7: 1250-1254 
(1994); Yokoe, H, and Meyer, T., Nature Biotech. 7^: 1252-1256 (1996)), it has 



gained increased attention in the field of molecular genetics since the 
demonstration of its utility as a reporter gene in transfected prokaryotic and 
eukaiyotic cells (Chalfie, M, etaL, Science 2^3:802-805 (1994); Heim, R., ei al, 
Proc. NatL Acad. ScL USA 97:1250-1254 (1994); Wang, S., and Hazelrigg, T., 
Nature itfP:400-403 (1994)). GFP has also been used in fluorescence resonance 
energy transfer studies of protdn-protein interactions (Heim, R., and Tsien, R, Y., 
Curr BioL 6:178-182 (1996)). Since GFP is naturally fluorescent, exogenous 
substrates and cofactors are not necessary for induction of fluorescence, thus 
providing GFP an advantage over the biochemical, luminescent and other 
fluorescent reporter genes described above. Visualization of GFP fluorescence 
does not require the fixation steps necessary with biochemical reporters such as 
P-gal and P-glu, nor does it require extraction fi-om the cell prior to assay as may 
be required with luciferase; thus, GFP is suitable for use in procedures requiring 
continued viability of transfected cells. In addition, since the GFP cDNA 
containing the complete coding region is less than 1 kilobase in size (Prasher, 
D.C., etal. Gene 777:229-233 (1992)), it is easily manipulated and inserted into 
a variety of vectors for use in creating stable transfectants (Chalfie, M., et al. 
Science 255:802-805 (1994)). 

Despite these advantages, however, the use of wUdtype GFP has a few 
limitations. For example, the exdtation and emission maxima of wildtype GFP are 
not within the range of wavelaigths of standard fluorescence optics (at which GFP 
demonstrates relatively low quantum yield (/.c., low intensity of fluorescence)). 
In addition, GFP shows low efficiency of transcription in mammalian cells upon 
transfecdon and is packaged into low-solubility inclusion bodies in bacteria (thus 
providing difficulty in purification). These limitations have been overcome to a 
limited extent via the mtroduction of selected point mutations into the sequence 
of wildtype GFP. 
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GFP Mutants 

One of the earliest mutation studies of GFP, in which the tyrosine residue 
at position 66 in the wildtype protein ("wt-GFP") was replaced with a histidine 
residue, resulted in a mutant protein which fluoresced blue instead of green when 
excited with ultraviolet (UV) light (Heim, R., et aL, Proc, NaiL Acad. ScL USA 
P/: 1250- 1254 (1994)). This mutant protein not only provided a capacity for two 
distinguishable wavelengths for use in studies comparing independent proteins and 
gene expression events, but also demonstrated that single point mutations in GFP 
could induce drastic changes in the photochemistry of the protein. Three other 
sets of specific point mutations have been shown to increase the excitation and 
emission maxima of GFP such that they fall well within the range of standard 
fluorescein optics (Ehrig, T., eiaL, FEBS Letts, 367:163-166 (1995); Delagrave, 
S, etaL, Bio/Technology 75:151-154 (1995); Heim, R., and Tsien, R., Curr. Biol 
6:178-182 (1996)), thus permitting the use of GFP with standard laboratory 
fluorescence detection systems. The problem of low quantum yield by wt-GFP 
has been partially addressed by mutating the serine residue at position 65 to a 
threonine ("S65T"), either without (Heim, R., et al, Proc, Natl Acad ScL USA 
97:1250-1254 (1994)) or with (Cormack, B., etal. Gene 77i:33-38 (1996)) a 
concomitant mutation at position 64, or by mutating other residues in the non- 
chromophore region (Crameri, A., et al. Nature Biotech. 7-^:315-319 (1996)). 
The S65T mutation also appears to improve the rate of fluorophore formation in 
transfected cells by approximately four-fold over wt-GFP, thus allowing earlier 
and more sensitive detection of transfection with this mutant than with wt-GFP 
(Heim, R., et al, Proc, Nail Acad Sci. USA P7:1250-1254 (1994)). By 
combining the S65T mutation with a mutation at position 64 replacing 
phenylalanine with leucine, approximately 90% of the mutant GFP expressed in 
bacteria is soluble, thus improving protein purification and yields (Cormack, B., 
et al. Gene 775:33-38 (1996)). Another series of mutations results in a mutant 
fusion GFP consisting of linked blue- and green-fluorescing proteins which have 
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proven useful in studies of protein localization, targeting and processing (Heim, 
R., and Tsien. R.Y,, Curr. Biol (5:178-182 (1996)). Analogously, chimeric 
constructs comprising GFP linked to other proteins have been used in studies of 
ion channel expression and fiinction (Marshall, J., et al,. Neuron 7^:211-215 
(1995)), and in organelle targeting studies where they have provided a means for 
selectively and distinctively labeling the organelles of living cells (Rizzuto et ai, 
Curr. Biol. 6:183-188 (1996)). Finally, by combining the S65T mutation with 
other mutations throughout the nonchromophore regions of the wt-GFP gene, a 
"humanized" mutant GFP (SEQ ID N0s:3, 4) has been produced that not only 
shows a significant increase in fluorescence intensity and rate of fluorophore 
formation over wt-GFP (via the S65T mutation) but also demonstrates a 22-fold 
increased expression efficiency in mammalian cells (Evans, K., et al, FOCUS 
18(2)AQ^2 (1996); Zolotukhin, S., et ai, 1 Virol 70:4646-4654 (1996)). This 
humanization was achieved via 92 base substitutions (in 88 codons) to the wt-GFP 
gene which were amino acid-conservative and which were made to provide a 
pattern of codon usage more closely resembling that of manmialian cells, as 
opposed to the jellyfish codon patterns found in the wt-GFP gene which are less 
eflSciently translated in mammalian cells. A summary of these GFP chromophore 
mutants is presented in Table 1. 
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Table 1. GFP Chromophore Mutants. 





j Amino Acid Residue Number: 




Mutant 


64 


65 


66 


Reference^ 


(Wildtype) 


Phe 


Ser 


Tyr 


Prasher ei 
al, 1992 


GreenLantern-l 


Phe 


Thr 


Tyr 


Evans et aL, 
1996 


riumanizeo orr 


rJie 


Thr 


Tyr 


Zolotukhin 
etaL, 1996 


Y66H 


Phe 


Ser 


His 


Heim etaL, 
1994 


Y66W 


Phe 


Ser 


Trp 


Y66F 


Phe 


Ser 


Phe 


RSGFPl 


Gly 


Ser 


Tyr 


Delagrave ei 
ai, 1995 


RSGFP2 


Leu 


Leu 


Tyr 


RSGFP3 


Gly 


Cys 


Tyr 


RSGFP4 


Met 


Gly 


Tyr 




vai 


Ala 
/\la 


lyr 


RSGFP7 


Leu 


Cys 


Tyr 


S65A 


Phe 


Ala 


Tyr 


Heim etai, 
1996 


S65L 


Phe 


Leu 


Tyr 


S65C 


Phe 


Cys 


Tyr 


S65T 


Phe 


Thr 


Tyr 


GFPmutl 


Leu 


Thr 


Tyr 


Cormack et 
aL 1996 



20 ^ See preceding text for MI citations. 



Despite some success in overcoming certain of the above-described 
limitations of GFPs, the sensitivity of GFP as a reporter gene (measured as 
percaitage of positive cells) is not as high as that of standard biochemical reporter 
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genes such as p-gal (Evans, K., et aL, FOCUS J8(2) A0'A3 (1996)). In addition, 
the use of GFP as a reporter gene or a protein tag requires the use of fluorescent 
excitation and emission optics, which increases user expense and which is more 
technically challenging than the use of visible or white light optics often used with 
standard reporters such as p-gal. Thus, a need currently exists for additional GFP 
variants which are more highly fluorescent, humanized, rapidly expressed in 
mammalian ceUs, capable of bdng observed using standard white light optics, and 
which provide an increased level of sensitivity. 



SUMMARY OF THE INVENTION 



It is thus an object of the present invention to provide mutant GFP cDNAs 
and proteins. In one aspect, the invention relates to such mutant GFP cDNAs 
which, when transfected into prokaryotic (e.g., bacterial) or eukaryotic (e.g,^ 
mammalian) cells, increase the sensitivity of detection (measured as percentage or 
number of positive cells). The present invention thus provides nucleic acid 
molecules encoding mutant GFPs, wherein the mutant GFPs have an amino acid 
sequence comprising an amino acid residue lacking an aromatic ring structure at 
position 64 and an amino acid residue ha\^ng a side chain no longer than two 
caibon atoms in length at poation 65. Preferably, (a) if the residue at position 64 
is leucine then the residue at position 65 is not cysteine or threonine; (b) if the 
residue at position 64 is valine then the residue at position 65 is not alanine; (c) if 
the residue at position 64 is methionine then the residue at position 65 is not 
glycine; and (d) if the residue at position 64 is glycine then the residue at position 
65 is not cysteine. The invention is particularly directed to such nucleic acid 
molecules encoding mutant GFPs wherein the amino acid residue at position 64 
is alanine, valine, leucine, isoleucine, proline, methionine, glycine, serine, 
threonine, cysteine, alanine, asparagine, glutamine, aspartic acid or glutamic acid, 
most preferably cysteine or methionine. The invention is also particularly directed 
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to such nucleic acid molecules encoding mutant GFPs wherein the amino acid 
residue at position 65 is alanine, glycine, threonine, cysteine, asparagine or 
aspartic acid, most preferably alanine. In particular, the invention provides 
nucleic add molecules encoding mutant GFPs wherein the amino acid at position 
64 is cysteine or methionine and the amino acid at position 65 is alanine, and 
nucleic acid molecules encoding mutant GFPs having an amino acid sequence as 
set forth in either SEQ ID N0:5 or SEQ ID N0:6. 

In additional aspects, the invention provides mutant GFPs encoded by any 
of the above-described nucleic acid molecules, vectors (particularly expression 
vectors) comprising these nucleic acid molecules, host cells (prokaryotic or 
eukaryotic (including mammalian)) comprising these nucleic acid molecules or 
vectors, and compositions comprising plasmid pGreenLantem-2/Al or plasmid 
pGreenLantem-2/A4. The invention also provides methods for producing a 
mutant GFP, comprising culturing the above-described host cells under conditions 
favoring the production of a mutant GFP and isolating the mutant GFP from the 
host cell. The invention also provides mutant GFPs produced by these methods, 
particulariy wherein the mutant GFPs emit fluorescent light when iUuminated with 
white light. The invention also relates to compositions comprising the above- 
described mutant GFPs. 

The invention is farther directed to kits for transfecting a host cell with the 
nucleic acid molecules encoding the present mutant GFPs, such kits comprising 
at least one container containing a nucleic acid molecule encoding a mutant GFP 
such as those described above, which preferably comprises plasmid 
pGreenLantem-2/Al or plasmid pGreenLantem-2/A4. These kits of the invention 
may optionally further comprise at least one additional container containing a 
reagent, preferably comprising a liposome and most preferably 
LIPOFECTAMINE™, for delivering a mutant GFP nucleic acid molecule into a 
host cell. 
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The invention is fuith^ directed to kits for labeling a polypeptide with the 
present mutant GFPs, such kits comprising at least one container containing a 
mutant GFP such as those described above, preferably a mutant GFP having an 
amino acid sequence as set forth in SEQ ID N0:5 or SEQ ID N0:6. These kits 
5 of the invention may optionally further comprise at least one additional container 

containing a reagent for covalently linking this mutant GFP to the target 
polypeptide. 

The fluorescence of all of the GFP mutants provided by the present 
invention is observable with fluorescein optics, making these mutant proteins 

10 amenable to use in techniques such as fluorescence microscopy and flow 

cytometry using standard FITC filter sets. In addition, the fluorescence of certain 
of the present GFP mutants, particularly those having amino acid sequences as set 
forth in SEQ ID NOs: 5 and 6, is visible using standard white light optics (e.g., 
incandescent or fluorescent indoor lighting, or sunlight). The nucleic acid 

15 molecules and mutant GFPs provided by the present invention thus contribute 

improved tools for detection of transfection, for fluorescent labeling of proteins, 
for construction of fiision proteins allowing examination of intracellular protein 
expression, biochemistry and traflBcking, and for other applications requiring the 
use of reporter genes. 

20 Other preferred embodiments of the present invention will be apparent to 

one of ordmaiy skiU in light of the following drawings and description of the 
invention, and of the claims. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a depiction of the nucleotide (SEQ ID NO: 1) and deduced 
25 amino acid (SEQ ID N0:2) sequences of A. victoria Green Fluorescent Protein 

cDNA (after Prasher, D.Q, etal.. Gene 777:229-233 (1992)). 
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Figure 2 is a depiction of the nucleotide (SEQ ID N0:3) and deduced 
amino acid (SEQ ID N0:4) sequences of humanized A, victoria Green 
Fluorescent Protein cDNA (after Zolotukhin, S., et aL, J. Virol. 70:4646-4654 
(1996)). 

Figure 3 is a depiction of the amino acid sequence (SEQ ID N0:5) of the 
Al GFP mutant. 

Figure 4 is a dq)iction of the amino acid sequence (SEQ ID N0:6) of the 
A4 GFP mutant. 

Figure 5 is a structural map of plasmid pGreenLantem-l, 

Figure 6 is a structural map of plasmid pGreenLantem-2. 

Figure 7 is a fluorescence photomicrograph of CHO-Kl cells viewed 24 
hours after transfection wth the Al GFP mutant (plasmid pGreenLantem-2/Al). 

Figure 8 is a fluorescence photomicrograph of CHO-Kl cells viewed 24 
hours afta- transfection with the A4 GFP mutant (plasmid pGreenLantem-2/A4). 

Figure 9 is a fluorescence photomicrograph of negative control CHO-Kl 
cells viewed 24 hours after transfection with the pGreenLantem-2 backbone. 

Figure 10 is a bar graph demonstrating the fluorescence of CHO-Kl cells 
deterniined by flow cytometry 24 hours after transfection with various GFP 
mutants. 
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Figure 11 is a bar graph demonstrating the fluorescence of CHO-Kl cells 
determined by flow cytometry 48 hours after transfection with various GFP 
mutants. 

5 Figure 12 is a structural map of plasmid pProEX HTb. 

DETAILED DESCRIPTION OF TEffi INVENTION 

Overview 

The present invention provides nucleic acid molecules encoding mutant 
(SFPs, vectors and host cells comprising these nucleic acid molecules, the mutant 

10 GFP polypeptides, and methods for producing mutant GFPs. Although specific 

plasmids, vectors, promoters, selection methods and host cells are disclosed and 
used hwein and in the Examples, other promoters, vectors, selection methods and 
host cells, both prokaryotic and eukaryotic, are well-known to one of ordinary 
skill in the art and may be used to practice the present invention without departing 

15 from the scope of the invention or any of the embodiments thereof 

In the present mvention, GFPs with selective point mutations at amino acid 
positions 64 and 65 have been constructed and analyzed. In general, it has been 
discovered in the present invention that when the ammo acid residue at position 
64 (phenylalanine in wt-CTP) is mutated to an anuno add lacking an aromatic ring 

20 (e.^., alanine, valine, leucine, isoleucine, proline, methionine, glycine, serine, 

threonine, cysteine, asparagine, glutamine, aspartic acid, glutamic acid, lysine, 
arginine or histidine), an increase in fluorescence quantum yield is observed. 
Increased fluorescence intensity is also observed when the amino acid residue at 
position 65 (serine in wt-GFP) is mutated to an amino acid having a side chain 

25 consisting of no more than two carbon atoms (eg., alanine, glycine, threonine, 

cysteine, asparagine or aspartic acid), which induce a significant "red-shift" in 
excitation maximum fi-om ultraviolet to visible blue wavelengths and a single 
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excitation maximum instead of a dual excitation maximum as in the wildtype 
protein. Together, these general results indicate that in order to construct GFP 
mutants with a dramatic increase in fluorescence intensity from wt-GFP, either 
position 64 or position 65 should contain a reactive amino acid, although 
particular amino adds appear to be preferred at each position as described below. 
Furthermore, it has been un©q)ectedly discovered that several of the mutant GFPs 
of the present invention, unlike those previously known in the art, will emit 
fluorescence when illuminated by white light {e.g., incandescent or fluorescent 
indoor lighting, or sunlight). 

Accordingly, in the present invention, specific mutations are introduced 
into positions 64 and 65 of the wt-GFP cDNA sequence (SEQ ID N0:1). 
Alternatively, increased expression of the present mutant GFPs may be obtained 
by introducing the preferred mutations into a humanized GFP gene such as that 
described previously (SEQ ID N0:3) (Evans, K., ei al, FOCUS J8(2)A0-43 
(1996); Zolotukhin, S., eial., J, ViroL 70:4646-4654 (1996)). 

Construction of GFP Mutants 

Preparation of GFP Plasnuds 

The wt-GFP may be cloned from its natural source, Aequorea victoria, as 
described (Prasher, D.C., ei al. Gene 7/7:229-233 (1992)). More preferably, 
GFP cDNA to be mutated is contained within a plasmid construct or vector, 
preferably an expression vector, suitable for use in transfecting mammalian cells, 
such as pRAY-1 wherein the wt-GFP cDNA is under the control of the human 
cytomegalovirus (CMV) enhancer/promoter (Marshall, J., ei aL, Neuron 14:211- 
215 (1995)). Most preferably, to provide for optimum expression of the mutant 
GFPs in mammalian cells, the humanized S65T mutant GFP cDNA (Evans, K., ei 
al, FOCUS 750:40-43 (1996); Zolotukhin, S., eiaL, J, ViroL 70:4646-4654 
(1996)) under control of the CMV enhancer/promoter may be used, contained in 
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plasmid pGreenLantem-1 (Figure 5), which is available commercially from Life 
Technologies, Inc. (Rockville, Maryland). 

The above-described plasmids may be used directly for preparation of 
mutant GFP cDNAs according to the present invention. Alternatively, a stop 
codon in the 5' multiple cloning site of pGreenLantem-l may be shifted out of 
frame by oligonucleotide ligation mediods to allow the mutant GFPs of the present 
invention to be used in the construction of fusions between GFP and other 
proteins, as described below. 

Mutations to GFP cDNA 

A variety of random or site-directed mutagenic techniques may be used to 
prepare the mutant GFPs of the present invention. Appropriate methods include 
chemical mutagenesis using, for example, sodium bisulfite or hydroxylamine 
(Myers, R.M., et al. Science 229:242-247 (1985); Sikorski, R.S., and Boeke, 
J,D,M^tk EmymoL 79^:302-318 (1991)), linker insertion mutagenesis (Heffron, 
F., ei al., Proc. NatL Acad ScL USA 75:6012-6016 (1978)), deletion mutagenesis 
(Lai, C.J., and Nathans, D., J. MoL BioL 59:179-193 (1974); McKnight, S.L., and 
Kingsbury, R., Science 217:316-324 (1982)), enzyme misincorporation 
mutagenesis (Shortle, D., et aL, Proc. Natl Acad Sci. USA 79:1588-1592 
(1982)), oligonucleotide-directed mutagenesis (Hutchinson, C.A., et al, J, BioL 
Chem. 253:6551-6560 (1978); Zoller, MJ, and Smith, M., NucL Acids Res. 
70:6487-6500 (1982); Taylor, J,W., et al, NucL Acids Res. 73:8765-8785 
(1985)), and cassette mutagenesis (Lo, K.-M., et aL, Proc. NatL Acad ScL USA 
57:2285-2289 (1984); WeUs, J.A, et aL, Gene 3-/:3 15-323 (1985)). To improve 
the fidelity and effidmcy of mutagenesis, the use of the polymerase chain reaction 
(PCR) in accomplishing GFP mutagenesis by one or more of the foregoing 
methods is preferred (Higuchi, R., etaL, NucL Acids Res. 7(5:7351-7367 (1988); 
Leung, D.W., et aL, Technique 7:1 1-15 (1989); Clackson, T., and Winter, G, 
NucL Acids Res. 77:10163-10170(1989)). 
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Most preferably, mutations are made to GFP cDNA by uracil DNA 
glycosjdase (UDG) mutagenesis using PCR amplification (Nisson, P., et al^ PCR 
Metk AppL 7:120-123 (1991)). In this approach, the plasmid containing GFP 
cDNA, most preferably pGreenLantem-1 comprising humanized S65T GFP 
(Figure S), is used as the PCR template, and a sense or antisense primer consisting 
essentially of an oligonucleotide containing at least one mismatched nucleotide 
(available commercially firom Life Technologies, Inc.; Rockville, Maryland) is 
added to the reaction mixture. Amplification reaction mixtures most preferably 
contain IX PCR buffer, about 10 micromolar each of deoxyATP, deoxyTTP, 
deoxyCTP and deoxyGTP, about 25 picomoles each of sense and antisense 
primers and about 10 nanograms of template. PCR is performed by techniques 
that are routine in the art, and after at least five PCR cycles, samples of the 
reaction mixture are treated with UDG, most preferably for 30 minutes at 37°C, 
as described (Nisson, P., etaL, PCRMetk AppL 7:120-123 (1991)). 

The mutated GFP nucleic acid molecules preferably will comprise nucleic 
acid sequences encoding mutant proteins in which one or more amino acid 
residues have been mutated fi'om the wildtype amino acid sequence set forth in 
Figure 1 and SEQ ID N0:2. Such mutations may include, for example, 
substitutions, ddetions, insertions or modifications, and preferably are amino acid 
substitutions. Particularly preferred are amino acid substitutions occurring in the 
three amino acid chromophore of GFP at residues 64, 65 and 66 of the wildtype 
GFP sequence (Figure 1 and SEQ ID N0:2), wherein the phenylalanine residue 
at position 64 (Phe64), the serine residue at position 65 (Ser65), and the tyrosine 
residue at position 66 (Tyr66), are each individually, or all together, replaced by 
other amino acid residues. More preferred mutant GFPs of the invention include, 
but are not limited to, those with the following substitutions fi*om the wildtype 
GFP sequence shown in Figure 1 and SEQ ED N0:2: 

•serine 65 replaced by threonine (SereS-^Thr); 

•Phe64~^Cys and Ser65-^Ala (SEQ ID N0:5); 



•Phe64->Cys and Ser65-^Thr; 
•Phe64-^Leu and Ser65-^Thr; 
•Phe64-^Met and Ser65-^Ala (SEQ ID N0:6); 
•Phe64-^Met and Ser65-^Thr; 
•Phe64^Met, Ser65->Phe and Tyr66->Phe; 
•Phe64^Met, Ser65-^Phe and Tyr66-^Lys; 
•Phe64-^Thr and Ser65-^Cys; and 
•Phe64->^Val and Ser65->Cys 

Other suitable mutations and mutant GFP amino acid sequences may be 
determined by one of ordinary skill without undue experimentation according to 
the methods described herein and others that are known in the art. As a practical 
matter, whether a particular mutation or combination of mutations produces a 
mutant GFP that may have the above-described desirable properties (e.g., higher 
expression in mammalian cells, higher fluorescence intensity under UV or white 
light illumination) may be determined by one of ordinary skill using the mutation, 
transfection, expression and detection methods described in detail below in the 
Examples, as well as using standard techniques that are routine in the art. 

Following mutagenesis by any of the aboye-described methods, the 
resulting nucleic acid molecules encoding the mutant GFPs may be inserted into 
one or more vectors, such as those described above, which are preferably 
expression vectors. A particularly preferred vector for containing the present 
mutant GFP nucleic add molecules is p-GreenLantem-2 (Figure 6). Methods for 
produdng the mutant GFP-vector constructs will be femiliar to those of ordinary 
skill, and are provided in detail below in Example 1. 

Once they have been constructed, the vectors comprising the mutant GFP 
nucleic acid molecules may be formulated into a variety of compositions, such as 
solutions (e.g., buffer solutions) to be used in transfecting host cells. 
Alternatively, the vector constructs may be purified and stored according to 
standard techniques for handling recombinant DNA plasmid vectors (Sambrook, 
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J., et aL, Molecular Cloning, a Laboratory Manual, 2nd Ed., Cold Spring 
Harbor, NY: Cold Spring Harbor Laboratory Press, pp. 1.3-1.20 (1989)). 

More preferably, the mutant GFP-containing plasmid vectors are 
transformed into a competent host cell. Any competent host cell may be used, 
induding those of bacteria (e,g., R coli), yeast (eg., Saccharomyces spp.), insects 
{e.g., Spodopiera spp.) and mammals (e.g., CHO or BHK cells), although a 
competent strain of coli such as DHIOB (Life Technologies, Inc.; Rockville, 
Maryland) is most preferably used. Transformation of mutagenized GFP cDNAs 
into host cells may be accomplished by any technique generally used for 
introduction of exogenous DNA, including the chemical, viral, electroporation, 
lipofection and microinjection methods that are well-known in the art. Particularly 
preferred methods for transformation include electroporation and liposome- 
mediated transfection (lipofection), the latter most preferably being accomplished 
using LIPOFECTAMINE™ (Life Technologies, Inc.; Rockville, Maryland). 

After expansion of transformed cultures, mutated GFP cDNA is isolated 
from the host cells by routine methods (Sambrook, J., et ai. Molecular Cloning, 
a Laboratory Manual, 2nd Ed., Cold Spring Harbor, NY: Cold Spring Harbor 
Laboratory Press, pp. 1.21-1.52 (1989)) and is subcloned into a plasmid backbone 
for use in subsequent transfections. Most preferably, this plasmid backbone is the 
pGreenLantem-2 backbone (see Figure 6) which contains a universal sequencing 
primer downstream from a CMV enhancer promoter and an Nsil site immediately 
upstream of the CMV promoter allowing excision of the promoter region, along 
with A&al, Xhol and HindOl sites in place of the 3' NoA site in pGreenLantem-1 
(Figure 4). 

Fusion sequences of GFP cDNA with nucleotide sequences encoding 
proteins of interest may be prepared by cloning the desired sequence(s) into 
pGreenLantem-2 at the 5' multiple cloning site using standard techniques. These 
fusion constructs allow the use of the mutant GFPs of the present invention as 
reporters of transfection efficiency. In addition, fusion constructs such as these 
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will allow a direct examination of the expression, biochemistry and localization of 
the fused proteins intracellularly. 

Alternatively, to examine the structure and function of regulatory 
sequences (e.g., promota^ oihancers, inhibitors) in native genes, the GFP mutant 
cDNAs may be directly transfected or inserted, using routine methods, into target 
genomic or extrachromosomal DNA sequences in host cells (Chalfie, M., ei aL, 
Science 263:802-805 (1994)). 

Transfection of Hosts With GFP Mutants 

Target cells to be transfected with cDNAs comprising mutant GFPs (either 
fused or unfiised to accessory sequences) are grown and maintained in culture 
according to routine methods. Cells may be transfected with mutant GFP cDNA 
by any method described above, although electroporation or liposome-mediated 
transfection (particulariy using LIPOFECT AMINE™) are preferred. Following 
transfection, cells are incubated for 12-48 hours, preferably 18-24 hours and most 
preferably for about 24 hours. Transfected cells may then be examined for the 
expression of mutant GFP, manifested as green intracellular fluorescence. With 
standard optical filters routinely used for examining fluorescein (typically 
excitation wavelength of about 475 nm, dichroic filter of 485 nm, emission 
wavelength of about 490 nm), this fluorescence may be examined qualitatively, for 
example by fluorescence microscopy, or quantitatively, for example by 
spectrofluorimetry or flow cytofluorimetry. In addition, transfected cells 
expressing relatively high amounts of mutant GFPs of the present invention may 
be separated fi-om non-transfected cells, or from those expressing lower levels of 
GFP, by fluorescence-based single cell separation tediniques such as fluorescence- 
activated cell sorting. Alternatively, transfected cells expressing mutant GFPs that 
fluoresce under white light illumination, particulariy those having amino acid 
sequences as set forth in SEQ ID NOs: 5 and 6, may be examined by the above- 
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described qualitative and quantitative methods using standard white light optics 
(e.g., incandescent or halogen lighting, or sunlight). 

These transfected host cells may also be used in methods for the 
production of mutant GFPs of the invention. Such methods may comprise, for 
example, culturing the above-described host cells under conditions favoring the 
production of the mutant GFPs by the host cells, and isolating the mutant GFPs 
fiom the host cells and/or the culture medium in which the host cells are cultured. 
Typical host cell culture conditions favoring production of recombinant proteins, 
such as the present mutant GFPs, are well-known in the art (see, e.g., Sambrook, 
J., et al. Molecular Cloning, a Laboratory Manual, 2nd Ed., Cold Spring 
Harbor, NY: Cold Spring Harbor Laboratory Press (1989)). The mutant GFPs 
produced by these methods may then be isolated by any of a number of protein 
purification techniques, such as chromatography (preferably aflBnity 
chromatography, HPLC or FPLC), salt extraction (such as ammonium sulfate 
precipitation), electrophoresis, dialysis, or a combination thereof, to produce 
isolated mutant GFPs of the invention. These mutant GFPs may then be stored 
until use (preferably at temperatures below C^C, more preferably at about -20'*C 
to about -70 "C), or they may be formulated into compositions. Preferred such 
compoations may comprise, for example, one or more of the mutant GFPs of the 
invention and one or more additional components, such as one or more buffer 
salts, one or more inorganic salts or ions thereof, one or more detergents, one or 
more preservatives, and the like, preferably in an aqueous or organic solvent. 

Detection Methods 

In additional embodiments, the invention relates to methods of detecting 
the presence of a mutant GFP, or of a cell (such as a prokaryotic or eukaryotic, 
including mammalian, cell) expressing a mutant GFP. Such methods of the 
invention may comprise, for example, illuminating the mutant GFP or cell 
expressing the mutant GFP with a source of white light under conditions such that 
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the mutant GFP or cell expressing the mutant GFP emits visible fluorescent light. 
In the present methods, the illumination source may be any light source emitting 
white visible) light, induing but not limited to an mcandescent light source, 
a fluorescent light source, a halogen light source, sunlight, and the like. When 
illuminated by such a white light source, mutant GFPs, such as those of the 
present invention, will emit fluorescent light of various visible wavelengths 
(depending upon the specific mutations contained in the mutant GFP, as described 
above), which may be detected by eye or by any of the above-described qualitative 
or quantitative mechanical means. 

Kits 

In other preferred embodiments, the compositions of the present invention 
may be assembled into kits for use in transfecting host cells with the nucleic acid 
molecules encoding the present mutant GFPs, or for labeling target polypeptides 
with the present mutant GFPs. Host cell transfection kits according to the present 
invention may comprise at least one container containing one or more of the 
above-described nucleic add molecules encoding a mutant GFP (or a composition 
comprising one or more of the nucleic acid molecules or plasmids described 
above), which nudeic add molecule preferably comprises plasmid pGreenLantem- 
2/Al or plasmid pGreenLantem-2/A4 (see Example 1 below). These transfection 
kits of the invention may optionally further comprise at least one additional 
container which may contain, for example, a reagent for delivering the mutant 
GFP nucleic add molecule into a host cell; in preferred kits, this reagent may 
comprise a liposome and most preferably LIPOFECTAMINE™. Polypeptide 
labeling kits according to the present invention may comprise at least one 
container containing, for example, a mutant GFP such as those described above 
(or a composition of the invention comprising a mutant GFP), which is preferably 
a mutant GFP having an amino acid sequence as set forth in SEQ ID NO: 5 or 
SEQ ID N0:6. These labeling kits of the invention may optionally further 
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comprise at least one additional container which may contain, for example, a 
reagent for covalently linking the mutant GFP to the target polypeptide. 

Use of Mutant GFPs 

The mutant GFPs and kits of the present invention may be used in a variety 
of applications. For example, the mutant GFP cDNAs are useful as reporter genes 
that allow a determination of transfection eflSciency and success (Chalfie, M., ei 
oL, Science 265:802-805 (1994)). Alternatively, the mutant proteins themselves 
may be used as fluorescent labels suitable for detectably labeling other proteins, 
nucleic acids or particulates to be used in a variety of applications (Heim, R., ei 
al,. Proa NatL Acad ScL USA 97:12501-12504 (1994); Yokoe, H., and Meyer, 
T., Nature Biotech. 7^:1252-1256 (1996)), such as labeling antibodies used in 
infectious disease diagnostic methods; mutant GFPs may be attached to target 
polypeptides and proteins by a variety of methods that are well-known to one of 
ordinary skill in the art, including the use of chemical coupling reagents. In 
addition, fusion complexes between GFP and other proteins may be constructed 
to allow closer and more sensitive determinations of the expression, biochemistry, 
localization and trafficking of intracellular proteins in many host cells (Heim, R., 
etaL^Proa NatL Acad. Sci. USA 97:12501-12504 (1994); Wang, S., and Tulle, 
H., Nature 569:400-403 (1994); Marshall, J., et aL, Neuron 14:1\ 1-215 (1995); 
Rizzuto, R., etaL, Curr. BioL 6:183-188 (1996)). Importantly, use of the mutant 
GFPs that emit fluorescence when illuminated by white light will spare the user 
considerable expense and technical difficulty that can accompany the use of 
fluorescent optics for the examination of fluorescent reporter genes such as GFP. 

It will be readily apparent to one of ordinary skill in the relevant arts that 
other suitable modifications and adaptations to the methods and applications 
described herein are obvious and may be made without departing from the scope 
of the invention or any embodiment thereof Having now described the present 
invention in detail, the same will be more clearly understood by reference to the 
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foUowing examples, which are included herewith for purposes of illustration only 
and are not intended to be limiting of the invention. 

Examples 

Example 1: Construction of Mutant GFP cDNAs 

Piasmids. As depicted in Figure 5, pGreenLantern-l (Life Technologies, 
Inc., Rockville, Maryland; catalogue no. 10642) contains the humanized S65T 
mutant GFP cDNA (Figure 2; SEQ ID N0s:3, 4) (Evans, K., et ai, FOCUS 
18(2). (1996); Zolotukhin, S., etal, 1 ViroL 70:4646-4654 (1996)). This 
plasmid serves as the source of the GFP DN A sequence used for mutagenesis. As 
depicted in Figure 6, pGreenLantem-2 contains a universal sequencing primer 
downstream of the CMV promoter along with an Nsil site immediately upstream 
of the CMV promoter allowing excision of the promoter region. It also contains 
Xbal, Xhol and HindDl sites in place of the 3' Noil site in pGreenLantem-l . A 
stop codon in the 5* multiple cloning site of pGreenLantem-1 was shifted out of 
frame to allow possible fusions to GFP in pGreenLaiitem-2. 

Mutations to GFP cDNA by UDG cloning. PCR was performed in an MJ 
Research DNA Engine™ thermal cycler using the following conditions: 94 *C for 
60 seconds, 94^*0 for 30 seconds, 55**C for 30 seconds and 72**C for 4 minutes, 
repeated for 20 cycles. Sense oligonucleotide primers containing specific 
mismatches to the wt-GFP sequence (SEQ ID NOs:7-15; Table 2) were obtained 
from Life Technologies, Inc. (Rockville, Maryland). 
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Table 2. Sense Oligonucleotides Used for UDG Cloning Mutations. 





Vector 


Amino 
Acid 
Mutations 


Single-stranded 
Oligonucleotide Sequence 
(5* to 3') 


SEQm 
NO: 




pGreenLantem- 
2/Al 


Cys64,Ala65 


CAACACUGGUCACUACCTG- 
CGCCTATGGCGTGC 


7 


5 


pGreenLantem- 

2/A2 


Cys64, Thr65 


CCAACACUGGUCACUACCT- 
GCACCTATGG 


S 


* 


pGreeoLantem- 

2/A3 


Leu64,Thr65 


CAACACUGGUCACUACCCT- 
CACCTATGGCGTGCAGT 


9 


10 


pGreenLanton- 
2/A4 


Met64,Ala65 


CAACACUGGUCACUACAAT- 
GGCCTATGGCGTGCAGTGCT 


10 




pGreenLantem- 
2/A5 


Met64, 
Thr65 


CAACACUGGUCACUACCAT- 
GACCTATGGCGTGCAGTGCT 


11 




pGreenLantem- 
2/A6 


Met64, 
Phe65, Phe66 


CAACACUGGUCACUACCAT- 
GTTCTTCGGCGTGCAGTGCT 


12 


15 


pGreenLantem- 
2/A7 


Met64, 
Phe65, Lys66 


CAACACUGGUCACUACCAT- 
GTTCAAGGGCGTGCAGTGCT 


13 




pGreeoLantem- 

2/A8 


Thr64, Cys65 


CAACACUGGUCACUACCAC- 
ATGCTATGGCGTGCAGT 


14 


20 


pGreenLantcm- 
2/A9 


Val64, Cys65 


CAACACUGGUCACUACCGT- 
GTGCTATGGCGTGCAGT 


15 



The antisense oligonucleotide primer used for each mutation set had the 
following sequence: 5*-AGU-GAC-CAG-UGU-UGG-CCA-AGG-CAC-AGG- 
GAG-CTT.3' (SEQ ID NO: 1 6). The template plasmid used was pGreenLantem- 1 
(Figure 5) with a universal reverse sequencing primer incorporated into the 

25 backbone. Amplifications reactions contained IX PGR buffer, 10 micromolar 

deoxynudeoside triphosphates, 25 picomoles of each primer (sense and antisense) 
and 10 nanograms of template DNA in a 50 microliter volume. After 6, 9 and 20 
PGR cycles were completed, 10 microliter samples were taken and checked via 
agarose gel electrophoresis for excess background. Two 20 microliter samples of 

30 each 6-cycle aliquot were digested with Dpnl at 37^*0 for 30 minutes, then at 
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TS^'C for 15 minutes and allowed to cool to room temperature. One of the 
samples from each reaction (four samples in all) was treated with one unit of uracil 
DNA glycosylase (UDG) at 37X for 30 minutes (Nisson, P., et al., PCRMeih. 
AppL 7:120-123 (1991)). PGR samples were then transformed into 100 
5 microliters of MAX Efficiency DHIOB™ Competent Cells (Life Technologies, 

Inc.; Rockville, Maryland). The mutated portion of the GFP cDNA was then 
subcloned with a Nod and BamYH digest into the pGreenLantem-2 backbone 
(Figure 6) which was not subjeaed to PCR (Sambrook, J., et al^ Molecular 
Cloning, a Laboratory Manual, 2nd Ed., Cold Spring Harbor, NY: Cold Spring 
10 Harbor Laboratory Press (1989)). This approach yielded nine separate mutant 

GFP plasmid vectors, designated pGreenLantem-2/Al through pGreenLantem- 
2/A9 (Table 2), each with a specific mutation or set of mutations within the GFP 
chromophore region at amino acids 64-66. 

Example 2: Growth and Transfection of Host Cells With Mutant GFPs 
15 Cell Culture, Chinese hamster ovaiy cells (CHO-Kl, obtained from 

American Type Culture Collection (ATCC), Rockville, Maryland) were cultured 
in D-MEM (4,500 milligrams/liter D-glucose with L-glutamine and phenol red) 
plus 10% fetal bovine serum (FBS), 0.1 millimolar nonessential amino acids, 2.5 
units per milliliter penicillin and 2.5 micrograms per milliliter streptomycin 
20 (Freshney, R,L, Culture of Animal Cells: A Manual of Basic Techniques, 3rd Ed., 

New York: Wilqf-Liss (1994)). Cells were grown at 37**C in a 5% COj/air 
incubator. All media and reagents were from Life Technologies, Inc., Rockville, 
Maryland. 

Transfection. CHO-Kl cells were plated at 2 x 10^ cells per well into six- 
25 well (35 millimeter diameter) plates one day prior to transfection. Immediately 

before transfection, cells were rinsed with medium containing no serum or 
antibiotics. LIPOFECTAMINE™ reagent was diluted into 100 microliters of 
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OPTI-MEM-I Reduced Serum Medium (without FBS) to give a final 
concentration of LIPOFECTAMINE of 6 microliters per well. DNA was diluted 
sq)arately to a concentration of 1 mia-ogram per well in 100 microliters of OPTI- 
MEM-I. Transfection complexes were formed by combining diluted lipid and 
DNA and incubating for 30 minutes prior to addition to cells. Transfection 
complexes were then diluted 1:5 with D-MEM containing no FBS or antibiotics 
and added to the rinsed cells. Cells were transfected for five hours at 37''C, then 
fed with an equal volume of D-MEM containing 20% FBS, 0.1 millimolar 
nonessOTtial amino adds, and no antibiotics. Cells were grown overnight at SV'^C, 
5% COj/air. In some studies, cells were grown for 48 hours; in these studies, 
transfection complexes were removed fi'om cells 24 hours after addition and cells 
were fed with 2 milliliters per well of complete medium. 

Regardless of the vector used, host cells transfected with the mutant GFP 
genes demonstrated approximately equivalent growth rates as control cells 
transfected with the wildtype GFP gene or with other reporter genes (e.g. , P-gal). 
These results indicate that transfection with the mutant GFP cDNAs of the present 
invention does not adversely affect the growth or culturability of the host cells 
more than transfection with any other reporter vector. 

Example 3: Charactenzfftion of GFP Mutants Expressed in Eukaryotic Cells 

Formalin Fixation. Transfected host ceUs were rinsed in Dulbecco's 
Phosphate Buffered Saline (PBS), then fixed in a solution of 1 0% formalin in PBS 
for one hour. Formalin was then removed, and cells were rinsed and stored in 
PBS at 4**C until being analyzed. 

Fluorescence Microscopy. Formalin-fixed cells were examined and 
photographed using an inverted phase contrast fluorescence microscope equipped 
with FITC filters (excitation 475 nm/dichroic 485 nm/barrier 490 nm) and a 50 
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watt mercury arc bulb at 1 .25 volts. A 40X-power adjustable non-phase objective 
was used for all micrographs, which were taken through blue, neutral and FITC 
filters using Kodak Ektachrome ASA 400 Daylight (for slides) or Kodak Gold 
ASA 400 Daylight (for prints). All exposures were for 12 seconds to allow 
unbiased comparison of fluorescence intensity. 

Flow Cytofluorimetry. Flow cytofluorimetry was performed on 
transfected CHO-Kl cells that were trypsinized and suspended in PBS plus 10% 
formalin at a concentration of less than 1 0^ cells per milliliter. Measurements were 
made on a Coulter EPICS® XL-MCL flow cytometer using a 1 5 megawatt argon 
ion laser. Filters used were 488 nm excitation, 500 nm dichroic LP/525 nm band 
pass for FLl (green channel) and 575 band pass/600 nm dichroic LP for FL2 
(orange channel). Samples consisted of 20,000 events using PMT voltages of 100 
voks for side scatter and forward scatter, 496 volts for FLl and 505 volts for FL2, 
all with integral gain set to 1.0. Color compensation included 7.9% orange signal 
in FLl and 3.2% green signal in FL2. 

Results, As shown in Table 3, the GFP mutants of the present invention 
displayed varying intensities and kinetics of formation in transfected cells. Two 
of these mutants, designated "Al" (phenylalanine mutated to cysteine at position 
64; serine mutated to alanine at position 65; Figure 3; SEQ ID N0:5) and "A4" 
(phaiylalanine mutated to methionine at position 64; serine mutated to alanine at 
position 65; Figure 4; SEQ ID N0:6) were exceptionally bright. As shown in 
Figures 7-9, CHO ceUs transfected with plasmid pGreenLantem.2/Al (Figure 7) 
or with plasmid pGreenLantem-2/A4 (Figure 8) demonstrated a dramatic increase 
in green fluorescence intensity over cells transfected with the humanized S65T 
mutation of pGreenLantem-1 (Figure 9) when viewed at 24 hours post- 
transfection using FITC optics. 
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Table 3. EfTects of Point Mutations on GFP Fluorescence Intensity. 



Vector 


Amino Acids 


Fluorescence Results 


Wildtype GFP 


Phe64, Ser65 


A«=395 nm (major), 470 nm (minor); 48 
hours required for detection 


S65T 


Phe64, Thr65 


6-fold increase in intensity over wildtype 


pureeni^Entem- 1 


1^11604, inroD 

(humanized) 


22-fold increase in intensity over wildtype 


pGreenLantem- 
2/Al 


Cyso4, Alao5 


6-fold increase in intensity over S65T 


pGreenLantern- 
2/A2 


Cyso4, inroj 


22-fold increase in intensity over wildtype 


pureenbantern- 
2/A3 


Leuo4, inroj 


6-fold increase in intensity over S65T 


pGreenLantem- 
2/A4 


Meto4, AJaoj 


6-fold increase in intensity over S65T 


pGreenLantern- 
2/A5 


Meto4, inroD 


Slight increase in intensity over 
pGreenLantern- 1 


poiGenjuanicTTi- 
2/A6 


JViei04, i^neOD, 
Phe66 


Equivalent to wildtype 


pGreenLantem- 

2/A7 


Met64, Phe65, 
Lys66 


Equivalent to wildtype 


pGreenLantem- 

2/A8 


Thr64, Cys65 


Equivalent to wildtype 


pGreenLantern- 
2/A9 


Val64, Cys65 


Slight increase in intensity over 
oGreenLantem-1 



Other mutants produced in the present studies were less satisfactory 
25 (Table 3). For example, mutants A5 (phenylalanine mutated to methionine at 

position 64; serine mutated to threonine at position 65) and A9 (phenylalanine 
mutated to valine at position 64; serine mutated to cysteine at position 65) gave 
only slightly better fluorescence than the humanized S65T mutation of 
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pGreenLantem-1 . It is possible that the highly reactive cysteine at position 65 in 
mutant A9 may interfere with the formation of the three amino acid heterocyclic 
ring required for GFP fluorescence (Cody, C.W., Biochemistry J2: 1212-1218 
(1993)). 

5 Mutant A2 (phenylalanine mutated to cysteine at position 64; serine 

mutated to threonine at position 65) was equal in fluorescence to the humanized 
S65T pGreenLantem-1 (Evans, K., et al, FOCUS J8(2)A0'43 (1996); 
Zolotukhin, S., et al, J. Virol 70:4646-4654 (1996)), whUe mutants A6 
(phenylalanine mutated to methionine at position 64; serine mutated to 

10 phenylalanine at position 65; ^osine mutated to phenylalanine at position 66), A7 

(phenylalanine mutated to methionine at position 64; serine mutated to 
phenylalanine at position 65; tyrosine mutated to lysine at position 66) and AS 
(phenylalanine mutated to threonine at position 64; serine mutated to cysteine at 
position 65) demonstrated a decreased fluorescence intensity and were, in fact, 

15 equivalent to wt-GFP. No shift in excitation or emission spectra was detected 

with these three mutants, however, as no fluorescence was observed using 
ultraviolet or rhodamine filter combinations. 

These results wa-e also observed via flow cytometry. As shown in Figure 
10, CHO-Kl cells transfected with the Al and A4 mutant GFPs demonstrated a 

20. dramatic increase in fluorescence over wildtype and A6-A8 mutants within 24 

hours of transfection. This high level of fluorescence was maintained, particularly 
for cells transfected with the A4 mutant GFP, for at least 48 hours after 
transfection (Figure 1 1). 

Mutations at certain amino acid positions outside the chromophore were 

25 also examined for their effects on GFP fluorescence. Mutation of Gln69-> Asn in 

the A4 mutant resulted in a dramatic decrease in fluorescence relative to the A4 
mutant itself as did mutation of Vall63-*Ala and Ilel67-^Thr in the A4 mutant. 

Together, these results indicate that the most preferable mutations for 
providing highly fluorescent, rapidly expressed GFPs are those in which only one 
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reactive amino acid is present at either position 64 or 65, as in the Al 
(Phe64-»Cys; Ser65-»Ala; SEQ ID N0:5) and A4 (Phe64-»Met; Ser65-»AJa; 
SEQIDN0:6) mutants. 

Example 4: Characterization of GFP Mutants Expressed in Prokaryotic Cells 

To examine the eflBcacy of expressing mutant GFPs in prokaryotic cells, 
mutant GFP cDNAs were subcloned into the bacterial pProEX HTb vector 
(Figure 12). GFP cDNA was excised by Notl and Xbal digestion from 
pGreenLantem-2 (Figure 6) containing the mutations at positions 64, 65 and/or 
66 (mutants Al through A9) shown in Table 3. The bacterial veaor pProEX HTb 
(Figure 12) was also digested with the same enzymes. The pProEX HTb 
backbone and GFP fragments were ligated, to form the corresponding transfection 
vectors containing the respective mutant GFP fragments: pProEXAl , pProEXA2, 
pProEXA3, pProEXA4, pProEXA5, pProEXA6, pProEXA?, pProEXAS and 
pProEXA9. These vectors were then individually transformed into 100 ^] of 
DHIOB E, coli host cells; control cells were also prepared that had been 
transfected with a construct containing the S65T mutant described in Examples 
1-3 above. Cells were plated onto ampicillin/IPTG plates and incubated overnight 
at 37*'C, and colonies were then picked and screened for fluorescence under long 
ultraviolet (UV) or blue illumination. 

Colonies containing the Al, A2, A3, A4, A5, A9 and S65T mutant GFPs 
all demonstrated greoi fluorescence when illuminated with long UV or blue light, 
while those containing the A6, A7 and A8 mutant GFPs demonstrated no 
fluorescence under these conditions. These results are consistent with those 
observed in eukaryotic cells, as shown in Example 3 above, and indicate that 
mutant GFPs may be successfully transfected into and expressed in prokaryotic 
cells. 
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Example 5: Visible Light Excitation of GFP Mutants 

To examine the ability of mutant GFPs to emit fluorescence when 
illuminated by white light, £. coli cells were transfected and plated as described 
above in Example 4. Colonies were then picked and examined for fluorescence 
5 upon illumination by incandescent light, fluorescent indoor lighting, or sunlight. 

Upon induction of the host cells with IPTG, cells transformed with the 
vector comprising the A4 GFP mutation unexpectedly exhibited bright green light 
emission under normal daylight conditions, without the need for excitation wth 
UV light. Similar results were observed for cells transformed with the A3 mutant 

10 GFP. Cells containing the Al and A5 mutant GFPs were also seen to be less (but 

still observably) fluorescent under white light illumination. Conversely, only very 
weak emission of light was observed under white light illumination in the cells 
transformed with the vectors comprising only the S65T, A2 and A9 mutations. 
Cells comprising the A6, A7 and A8 mutations exhibited no fluorescence when 

15 illuminated by white light. 

When plates containing these mutants were stored in the dark at 4°C for 
38 days, however, all of the colonies except those containing the A6, A7 or AS 
mutant GFPs were seen to be more intensely fluorescent under white light 
illumination. Colonies containing the A3, A4 and A5 mutants were more 

20 fluorescait under these conditions than were those containing the Al , A2, A9 and 

S65T mutants, although all colonies fluoresced more brightly than they did in 
freshly plated cells (i.e., when observed withm 24-48 hours of transfection). 
When these plates were allowed to warm to room temperature, the fluorescence 
in colonies containing the Al, A2, A9 and S65T mutants decreased, while that in 

25 colonies containing the A3, A4 and A5 mutants remained brightly fluorescent. 

It is possible that the increased fluorescence observed in stored plates may 
have been due to accumulation of mutant protein in the cells over time in storage, 
indicating a dependence of white light fluorescence upon intracellular 
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concentration of the GFP. To test this notion, a 6His-tagged A4 GFP construct 
prepared and isolated by metal affinity chromatography according to standard 
techniques {see Ausubd, EM., et al.^ in Current Protocols in Molecular Biology, 
New York: John Wiley & Sons, Inc., pp. 10.11.10-10.11.24 (1996)), was 
examined for fluorescence under blue, red and white light at various protein 
concentrations in solution. At a concentration of about 1.5 ng/ml, the purified A4 
GFP was brightly fluorescent under sunlight and fluorescent indoor white lighting, 
as well as under blue light; no fluorescence was observed, however, under red 
light. This highly concentrated A4 GFP solution became nonfluorescent upon 
boiling, but was at least slightly fluorescent up to a temperature of about 82 °C. 
When diluted to 0. 1 ^ig/ml, however, the A4 GFP solution fluoresced brightly 
under blue light (closer in wavelength to the excitation maximum of GFP which 
is in the UV range), but did not fluoresce under white light illumination. These 
results suggest that the increased fluorescence observed upon white light 
illumination of colonies stored for extended periods of time may be due to 
accumulation of GFP protein in the cells, 

Tdken together, these results indicate that prokaryoric cells containing the 
A3 or A4 mutant GFPs, and to a lesser extent the Al and A5 mutant GFPs, can 
emit light without the addition of an exogenous substrate or the use of ultraviolet 
irradiation. Use of these GFP constructs thus provides advantages over other 
visible light reporter vectors which require the use of exogenous substrates, and 
over other fluorescent reporter vectors which require UV irradiation which may 
induce undesirable mutations in the host cells. 

Example 6: Additional GFP Mutations ^ 

To examine the effeas of alternative point mutations on GFP fluorescence, 
mutations are targeted at the tryptophan residue at position 67 (the only 
tiyptophan residue in the entire GFP molecule which is located in the unique motif 
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Pro-Val-Pro-Trp-Pro (SEQ ID NO: 17)). To accomplish this mutation, 
oligonucleotides are designed to mutate Trp57-*His or Trp57--»Tyr, in 
conjunction wth the Ser65-»Thr mutant (SEQ ID N0:4) or the Phe64-»Met; 
Ser6S-*Ala mutant (SEQ ID N0:6). These mutants are made in the bacterial 
5 vector pProEX HTb as described in Example 4, using specific oligonucleotides 

designed to provide the desired mutations. The vector constructs are then 
transfected into host cells and characterized as above for their fluorescence. 

In a similar fashion, mutations are made at other amino acid positions 
outside of the GFP chromophore region. For example, mutations are made at 
10 Arg96, which is probably responsible for stabilizing resonance structures of the 

imidazolidone 5-membered ring during ring formation and possibly during 
excitation, and is therefore a target for more rapid ring formation and, hence, 
faster detection of fluorescence. Mutations involving this residue include 
Arg96->His. 

15 Mutations are also possible at Phe46, which along with Phe64 separates 

the 5-membered chromophore ring fi-om direct contact with the single tryptophan 
in the Ser65->Thr GFP (SEQ ID N0:4). By allowing direct hydrogen bonding 
between Trp57 and the ring structure, efficient energy transfer is possible as with 
the Phe64-»Leu; Ser64-*Thr mutant. Mutations involving this residue include 

20 Phe46-^Leu or other hydrophobic residues that promote hydrogen bondmg. 

Mutations are also made at Leu221 and Phe223, which are involved in 
dimer formation. Only three hydrophobic residues are in the dimer contact region; 
all others are hydrophobic. By mutating Leu221 and/or Phe223 to a hydrophilic 
or "neutral" residue such as glycine, GFP aggregation, which can be a problem 

25 with GFP fusion constructs, may be inhibited. 

Mutations are also made at His 148, which probably stabilizes the 
fluorophore and forms hydrogen bonds with Tyr66 and Gln94. Mutations of 
Hisl48 to a residue with a different charge or a different pKa are made to allow 
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alteration of the excitation and emission spectra of GFP, similar to results seen 
with Tyr66-»His which results in blue fluorescence by GFP. 

Finally, mutations introducing a second S-membered ring structure into 
the a-helix of GFP are made, to allow increased fluorescence intensity of the 
resultant GFP. 

Having now fiiUy described the present invention in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be obvious 
to one of ordinary skill in the art that the same can be performed by modifying or 
changing the invention within a wide and equivalent range of conditions, 
formulations and other parameters without aflfecting the scope of the invention or 
any specific embodiment thereof, and that such modifications or changes are 
intended to be encompassed within the scope of the appended claims. 

All publications, patents and patent apphcations mentioned in this 
specification are indicative of the level of skill of those skilled in the art to which 
this invention pertains, and are herein incorporated by reference to the same extent 
as if each individual publication, patent or patent application was specifically and 
individually indicated to be incorporated by reference. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Life Technologies, Inc. 

(B) STREET: 9800 Medical Center Drive 

(C) CITY: Rockville 

(D) STATE: Maryland 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 20850 

(ii) TITLE OF INVENTION: Mutants of Green Fluorescent Protein 
(iii) NUMBER OF SEQUENCES: 17 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC COitpatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.30 (EPO) 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: (To be assigned) 

(B) FILING DATE: 17 -NOV- 1997 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US (To be assigned) 

(B) FILING DATE: 14 -NOV- 1997 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/030,935 

(B) PILING DATE: 15 -NOV- 1996 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: gfplO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..714 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ATG AGC AAG GGC GAG GAA CTG TTC ACT GGC GTG GTC CCA ATT CTC GTG 48 
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
15 10 15 

GAA CTG GAT GGC GAT GTG AAT GGG CAC AAA TTT TCT GTC AGC GGA GAG 96 
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

GGT GAA GGT GAT GCC ACA TAC GGA AAG CTC ACC CTG AAA TTC ATC TGC 144 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
35 40 45 

ACC ACT GGA AAG CTC CCT GTG CCA TGG CCA ACA CTG GTC ACT ACC TTC 192 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 

ACC TAT GGC GTG CAG TGC TTT TCC AGA TAC CCA GAC CAT ATG AAG CAG 240 
Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Mfet Lys Gin 
65 70 75 80 

CAT GAC TTT TTC AAG AGC GCC ATG CCC GAG GGC TAT GTG CAG GAG AGA 288 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

ACC ATC TTT TTC AAA GAT GAC GGG AAC TAC AAG ACC CGC GCT GAA GTC 336 
Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

AAG TTC GAA GGT GAC ACC CTG GTG AAT AGA ATC GAG TTG AAG GGC ATT 384 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie 
115 120 125 

GAC TTT AAG GAA GAT GGA AAC ATT CTC GGC CAC AAG CTG GAA TAC AAC 432 
Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

TAT AAC TCC CAC AAT GTG TAC ATC ATG GCC GAC AAG CAA AAG AAT GGC 480 
Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

ATC AAG GTC AAC TTC AAG ATC AGA CAC AAC ATT GAG GAT GGA TCC GTG 528 
He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

CAG CTG GCC GAC CAT TAT CAA CAG AAC ACT CCA ATC GGC GAC GGC CCT 576 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

GTG CTC CTC CCA GAC AAC CAT TAC CTG TCC ACC CAG TCT GCC CTG TCT 624 
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 
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AAA GAT CCC AAC GAA AAG AGA GAC CAC ATG GTC CTG CTG GAG TTT GTG 672 
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

ACC GCT GCT GGG ATC ACA CAT GGC ATG GAC GAG CTG TAG AAG 714 
Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 

TGA 717 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
15 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 

Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 



Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 



Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 



Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 
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He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 

225 230 235 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLBCOLE TYPE: cDNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: gf p (h) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: l.,714 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG AGT AAA GGA GAA GAA CTT TTC ACT GGA GTT GTC CCA ATT CTT GTT 48 
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
240 245 250 



GAA TTA GAT GGT GAT GTT AAT GGG CAC 
Glu Leu Asp Gly Asp Val Asn Gly His 
255 260 

GGT GAA GGT GAT 6CA ACA TAC GGA AAA 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys 
275 

ACT ACT GGA AAA CTA CCT GTT CCA TGG 
Thr Thr Gly Lys Leu Pro Val Pro Trp 
290 295 



AAA TTT TCT GTC AGT GGA GAG 96 
Lys Phe Ser Val Ser Gly Glu 
265 270 

CTT ACC CTT AAA TTT ATT TGC 144 

Leu Thr Leu Lys Phe He Cys 
280 285 

CCA ACA CTT GTC ACT ACT TTC 192 
Pro Thr Leu Val Thr Thr Phe 
300 
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TCT TAT GGT GTT CAA TGC TTT TCA AGA TAG CCA GAT CAT AT6 AAA CAG 240 
Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
305 310 315 

CAT GAG TTT TTC AAG AGT GCC ATG CCC GAA GGT TAT GTA CAG GAA AGA 2B8 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
320 325 330 

ACT ATA TTT TTC AAA GAT GAC GGG AAC TAG AAG ACA CGT GCT GAA GTC 336 
Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
335 340 345 350 

AAG TTT GAA GGT GAT ACC CTT GTT AAT AGA ATC GAG TTA AAA GGT ATT 384 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
355 360 365 

GAT TTT AAA GAA GAT GGA AAC ATT CTT GGA CAC AAA TTG GAA TAG AAC 432 
Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
370 375 380 

TAT AAC TCA CAC AAT GTA TAG ATC ATG GGA GAC AAA CAA AAG AAT GGA 480 
Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
385 390 395 

ATC AAA GTT AAC TTG AAA ATT AGA CAC AAC ATT GAA GAT GGA AGC GTT 528 
He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
400 405 410 

CAA CTA GCA GAC CAT TAT CAA CAA AAT ACT CCA ATT GGC GAT GGG CCT 576 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
415 420 425 430 

GTC CTT TTA CCA GAC AAC CAT TAG CTG TGC ACA CAA TCT GCC CTT TCG 624 
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
435 440 445 

AAA GAT CCC AAC GAA AAG AGA GAC GAG ATG GTC CTT CTT GAG TTT GTA 672 
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
450 455 460 

ACA GCT GCT GGG ATT ACA CAT GGC ATG GAT GAA CTA TAG AAA 714 
Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
465 470 475 



TAA 



717 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 

(B) TyPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 



Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
1 5 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 

Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 



Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 5: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acide 

(B) TYPE: amino acid 

(C) STRANDEDNBSS : not relevant 

(D) TGPOLCXSY: not relevant 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
15 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Cys 
50 55 60 

Ala Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 



Lys Asp 



Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu 



Phe Val 
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210 215 220 

Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Lys 

225 230 235 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECUI*E TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
15 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Met 
50 55 60 

Ala Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Mfet Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 ' 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 
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Gin 



Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro 
180 185 190 



Val 



Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 



Lys 



Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 



Thr 
225 



Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Lys 
230 235 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CAACACUGGU CACUACCTGC GCCTATGGCG TGC 33 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CCAACACUGG UCACUACCTG CACCTATGG 29 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 
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(ii) MOLECULE TYPE: CDNA 



(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 9: 
CAACACUGGD CACUACCCTC ACCTATGGCG TGCAGT 36 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHT^CTERISTICS : 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CAACACUGGU CACUACAATG GCCTAT6GCG TGCAGTGCT 39 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CAACACUGGU CACOACCATG ACCTATGGCG TGCAGTGCT 39 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID £10:12: 
CAACACUGGD CACDACCATG TTCTTCGGCG TGCAGTGCT 39 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CAACACUGGU CACUACCATG TTCAAGGGCG TGCAGTGCT 39 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CAACACUGGU CACUACCACA T6CTATGGCG TGCAGT 36 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CAACACUGGD CACOACCGTG TGCTATGGCG TGCAGT 36 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AGUGACCAGU GUUGGCCAAG GCACAGGGAG CTT 33 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Pro Val Pro Trp Pro 
1 5 
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WHAT IS CLAIMED IS: 

1 . A nucleic acid molecule encoding a mutant Green Fluorescent 
Protein, said mutant Green Fluorescent Protein having an amino acid sequence 
comprising an amino acid residue lacking an aromatic ring structure at position 64 
and an amino acid residue having a side chain no longer than two carbon units in 
length at position 65, with the provisos that 

if said residue at position 64 is leucine then said residue at position 65 is 
not cysteine or threonine; 

if said residue at position 64 is valine then said residue at position 65 is not 

alanine; 

if said residue at position 64 is methionine then said residue at position 65 
is not glycine; and 

if said residue at position 64 is glycine then said residue at position 65 is 
not cysteine. 

15 2. The nucleic acid molecule of claim 1, wherein said amino acid 

residue at position 64 is selected from the group consisting of alanine, valine, 
leucine, isoleucine, proline, methionine, glycine, serine, threonine, cysteine, 
alanine, asparagine, glutamine, aspartic acid and glutamic acid. 

3. The nucleic acid molecule of claim 1, wherein said amino acid 
20 residue at position 64 is cysteine or methionine. 

4. The nucleic acid molecule of claim 3, wherein said amino acid 
residue at position 65 is alanine. 



5 



10 
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5. The nucleic acid molecule of claim 1, wherein said amino acid 
residue at position 65 is selected from the group consisting of alanine, glycine, 
threonine, cysteine, asparagine and aspartic acid. 

6. The nucleic acid molecule of claim 1, wherein said amino acid 
residue at position 65 is alanine. 

7. The nucleic acid molecule of claim 6, wherein said amino acid 
residue at position 64 is cysteine or methionine, 

8. A nucleic acid molecule encoding a mutant Green Fluorescent 
Protein, said mutant Green Euorescent Protein having an amino acid sequence as 
setforthinSEQIDNO:5. 

9. A nucleic acid molecule encoding a mutant Green Fluorescent 
Protein, said mutant Green Fluorescent Protein having an amino acid sequence as 
set forth in SEQ ID N0:6. 

10. A mutant Green Fluorescent Protein havmg an amino acid 
sequence comprising an amino acid residue lacking an aromatic ring structure at 
position 64 and an amino acid residue having a side chain no longer than two 
carbon atoms in length at position 65, vnth the provisos that 

(a) if said residue at position 64 is leucine then said residue at position 65 
is not cysteine or threonine; 

(b) if said residue at position 64 is valine then said residue at position 65 
is not alanine; 

(c) if said residue at position 64 is methionine then said residue at position 
65 is not glycine; and 
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(d) if said residue at position 64 is glycine then said residue at position 65 
is not cysteine. 

11. The mutant Green Fluorescent Protein of claim 10, wherein said 
amino acid residue at position 64 is selected from the group consisting of alanine, 

5 valine, leudne, isol^dne, proline, methionine, glycine, serine, threonine, cysteine, 

alanine, asparagine, glutamine, aspartic acid and glutamic acid. 

12. The mutant Green Fluorescent Protein of claim 10, wherein said 
amino acid residue at position 64 is cysteine or methionine. 



13. The mutant Green Fluorescent Protein of claim 12, wherein said 
10 amino acid residue at position 65 is alanine. 

14. The mutant Green Fluorescent Protein of claim 10, wherein said 
amino acid residue at position 65 is selected from the group consisting of alanine, 
glycine, threonine, cysteine, asparagine and aspartic acid. 

15. The mutant Green Fluorescent Protein of claim 1 0, wherein said 
15 amino acid residue at position 65 is alanine. 

16. The mutant Green Fluorescent Protein of claim 15, wherein said 
amino acid residue at position 64 is cysteine or methionine. 

17. A mutant Green Fluorescent Protein having an amino acid 
sequence as set forth in SEQ ID N0:5. 



20 18. A mutant Green Fluorescent Protein having an amino acid 

sequence as set forth in SEQ ID N0:6. 
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19. A host cell comprising the nucleic acid molecule of claim 1 . 

20. A vector comprising the nucleic acid molecule of claim 1 . 

2 1 . The vector of claim 20, wherein said vector is an expression vector. 

22. A host cell comprising the vector of claim 20. 

23. A method for producing a mutant Green Fluorescent Protein, 
comprising culturing the host cell of claim 19 or claim 22 under conditions 
favoring the production of a mutant Green Fluorescent Protein, and isolating said 
mutant Green Fluorescent Protein from said host cell. 

24. A mutant Green Fluorescent Protein produced by the method of 
claim 23. 

25. The mutant Green Huorescent Protein of any one of claims 10, 17, 
18 or 24, wherein said mutant Green Fluorescent Protein emits fluorescent light 
when illuminated by white light. 

26. A composition comprising plasmid pGreenLantem-2/Al . 

27. A composition comprising plasmid pGreenLantem-2/A4. 

28. A composition comprising the mutant Green Fluorescent Protein 
of any one of claims 10, 17, 18 or 24. 

29. A composition comprising the mutant Green Fluorescent Protein 
of claim 25. 
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30. A kit for transfecting a host cell with a nucleic acid molecule 
encoding a mutant Green Fluorescent Protein, said kit comprising at least one 
container containing the nucleic acid molecule of claim 1 . 

3 1 . The kit of claim 30, wherein said nucleic acid molecule comprises 
plasmid pGreenLantem-2/Al or plasmid pGreenLantem-2/A4. 

32. The kit of claim 30, further comprising at least one additional 
container containing a reagent for delivering said nucleic acid molecule into a host 
cell. 

33 . The kit of claim 32, wherein said reagent for delivering said nucleic 
acid molecule into a host cell comprises a liposome. 

34. A kit for labeling a polypeptide with a mutant Green Fluorescent 
Protein, said kit comprising at least one container containing the mutant Green 
Fluorescent Protein of any one of claims 10, 17, 18 or 24. 

35. The kit of claim 34, wherein said mutant GFP fluoresces when 
illuminated by white light. 

36. The kit of claim 34, further comprising at least one additional 
container containing a reagent for covalently attaching said mutant Green 
Fluorescent Protein to a polypeptide. 

37. A method of detecting the presence of a mutant GFP comprising 
illuminating the mutant GFP with a source of white light under conditions such 
that the mutant GFP emits visible fluorescent light. 
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38. A method of detecting the presence of a cell expressing a mutant 
GFP comprising illuminating the cell with a source of white light under conditions 
such that the mutant GFP expressed by the cell emits visible fluorescent light. 
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ATG AGT AAA GGA GAA GAA CU TJC ACT 6GA GH GTC CCA AH CTJ GU 
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
15 10 15 

GAA HA GAT GGT GAT GTT AAT GGG CAC AAA TH TCT GTC AGT GGA GAG 
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

GGT GAA GGT GAT GCA ACA TAG GGA AAA CU ACC CTT AAA TTT AH TGC 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 

ACT ACT GGA AAA CTA CCT GTT CCA TGG CCA ACA CTJ GTC ACT ACT HC 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 

TCT TAT GGT GU CAA TGC TTT TCA AGA TAC CCA GAT CAT ATG AAA CAG 
Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

CAT GAC TTT no AAG AGT GCC ATG CCC GAA GGT TAT 6TA CAG GAA AGA 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

ACT ATA TTT TTC AAA GAT GAC GGG AAC TAC AAG ACA CGT GCT GAA GTC 
Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

AAG TTT GAA GGT GAT ACC CU GTT AAT AGA ATC GAG TTA AAA GGT AH 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

GAT TTT AAA GAA GAT GGA AAC ATT CTT GGA CAC AAA HG GAA TAC AAC 
Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

TAT AAC TCA CAC AAT GTA TAC ATC ATG GCA GAC AAA CAA AAG AAT GGA 
Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

ATC AAA GH AAC TTC AAA AH AGA CAC AAC ATT GAA GAT GGA AGC GTT 
He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

CAA CTA GCA GAC CAT TAT CAA CAA AAT ACT CCA ATT GGC GAT GGC CCT 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

GTC Cn HA CCA GAC AAC CAT TAC CTG TCC ACA CAA TCT GCC CTT TCG 
Val Leu Leu Pro Asp /^n His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

AAA GAT CCC AAC GAA AAG AGA GAC CAC ATG GTC CTT CTT GAG TH" GTA 
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

ACA GCT GCT GGG AH ACA CAT GGC ATG GAT GAA CTA TAC AAA TAA 
Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys * 
225 230 235 

(SEQ ID NOs:l. 2} 

FIG.1 

SUBSTITUTE SHEET (RULE 26) 
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ATG 
Met 


AGT 
Ser 
240 


AAA 
Lys 


GGA 
Gly 


GAA 
Glu 


GAA 
Glu 


cn 

Leu 
245 


nc 

Phe 


ACT 
Thr 


GGA 
Gly 


GH 
Val 


GTC 
Val 
250 


CCA 
Pro 


An 

He 


cn 

Leu 


Gn 

Val 


48 


GAA 
G1u 
255 


HA 
Leu 


GAT 
Asp 


GGT 
Gly 


GAT 
Asp 


GH 
Val 
260 


AAT 
Asn 


GGG 
Gly 


CAC 
His 


AAA 
Lys 


m 

Phe 
265 


TCT 
Ser 


GTC 
Val 


AGT 
Ser 


GGA 
Gly 


GAG 
Glu 
270 


96 


6GT 
Gly 


GAA 
G1u 


GGT 
Gly 


GAT 
Asp 


GCA 
Ala 
275 


ACA 
Thr 


TAC 
Tyr 


GGA 
Gly 


AAA 
Lys 


cn 

Leu 
280 


ACC 
Thr 


CTT 
Leu 


AAA 
Lys 


TTT 
Phe 


An 

He 
285 


TGC 
Cys 


144 


ACT 
Thr 


ACT 
Thr 


GGA 
Gly 


AAA 
290 


CTA 
Leu 


CCT 
Pro 


GTT 
Val 


CCA 
Pro 


TGG 
Trp 
295 


CCA 
Pro 


ACA 
Thr 


CTT 
Leu 


GTC 
Val 


ACT 
Thr 
300 


ACT 
Thr 


nc 

Phe 


192 


TCT 
Ser 


TAT 
Tyr 


GGT 
305 


GU 
Val 


CAA 
Gin 


TGC 
Cys 


TTT 
Phe 


TCA 
Ser 
310 


AGA 
Arg 


TAC 
Tyr 


CCA 
Pro 


GAT 
Asp 


CAT 
His 
315 


ATG 
Met 


AAA 
Lys 


CAG 
Gin 


240 


CAT 
His 


GAC 
Asp 
320 


nr 

Phe 


nc 

Phe 


AAG 
Lys 


AGT 
Ser 


GCC 
Ala 
325 


ATG 
Met 


CCC 
Pro 


GAA 
Glu 


GGT 
Gly 


TAT 
Tyr 
330 


GTA 
Val 


CAG 
Gin 


GAA 
Glu 


AGA 
Arg 


288 


ACT 
Thr 
335 


ATA 
He 


m 

Phe 


TTC 
Phe 


AAA 
Lys 


GAT 
Asp 
340 


GAC 
Asp 


GGG 
Gly 


AAC 
Asn 


TAC 
Tyr 


AAG 

Lys 

345 


ACA 
Thr 


CGT 
Arg 


GCT 
Ala 


GAA 
Glu 


GTC 
Val 
350 


336 


AAG 
Lys 


TTT 
Phe 


GAA 
Glu 


GGT 
Gly 


GAT 
Asp 
355 


ACC 
Thr 


cn 

Leu 


m 

Val 


AAT 
Asn 


AGA 
Arg 
360 


ATC 
He 


GAG 
Glu 


nA 

Leu 


AAA 
Lys 


GGT 
Gly 

365 


An 

He 


384 


GAT 
Asp 


TTT 
Phe 


AAA 
Lys 


GAA 
Glu 
370 


GAT 
Asp 


GGA 
Gly 


AAC 
Asn 




CTT 
Leu 
375 


GGA 
Gly 


CAC 
His 


AAA 
Lys 


nG 

Leu 


^ 
Glu 

380 


TAC 
Tyr 


AAC 
Asn 


432 


TAT 
Tyr 


AAC 
Asn 


TCA 
Ser 
385 


CAC 
His 


AAT 
Asn 


GTA 
Val 


TAC 
Tyr 


ATC 
He 
390 


ATG 
Met 


GCA 
Ala 


GAC 
Asp 


AAA 

Lys 


CAA 
Gin 

395 


AAG 
Lys 


AAT 
Asn 


GGA 
Gly 


480 


ATC 
lie 


AAA 
Lys 

m 


GU 
Val 


AAC 
Asn 


TTC 
Phe 


AAA 
Lys 


ATT 
He 
405 


AGA 
Arg 


CAC 
His 


AAC 
Asn 


AH 
He 


GAA 
Glu 
410 


GAT 
Asp 


GGA 
Gly 


AGC 
Ser 


Gn 

Val 


528 


CAA 
G1n 
415 


CTA 
Leu 


GCA 
Ala 


GAC 
Asp 


CAT 
His 


TAT 
Tyr 
420 


CAA 
Gin 


CAA 
Gin 


AAT 
Asn 


ACT 
Thr 


CCA 
Pro 
425 


ATT 
He 


GGC 
Gly 


GAT 
Asp 


GGC 
Gly 


CCT 
Pro 
430 


576 


GTC 
Val 


CTT 
Leu 


TTA 
Leu 


CCA 
Pro 


GAC 
ASD 
435 


AAC 
Asn 


CAT 
His 


TAC 
Tyr 


CTG 
Leu 


TCC 
Ser 
440 


ACA 
Thr 


CAA 
Gin 


TCT 
Ser 


GCC 
Ala 


cn 

Leu 
445 


TCG 
Ser 


624 


AAA 


GAT 


CCC 
Prn 


AAC 
450 


GAA 


AAG 


AGA 

Am 


GAC 


CAC 
His 
455 


ATG 
Met 


GTC 
Val 


cn 

1 Ptl 


cn 

1 PI J 


GAG 

fill! 

VJ 1 LI 

460 


m 

r i ic 


GTA 
Val 


672 


ACA 
Thr 


GCT 
Ala 


GCT 
Ala 
465 


GGG 
Gly 


ATT 
He 


ACA 
Thr 


CAT 
His 


GGC 
Gly 
470 


ATG 
Met 


GAT 
Asp 


GAA 
Glu 


CTA 
Leu 


TAC 
Tyr 
475 


AAA 
Lys 






714 



TAA 717 

(SEQ ID N0s:3. 4) 



FIG.2 



SUBSTITUTE SHEET (RULE 26) 



wo 98/21355 PCT/US97/21662 

3/12 

Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
15 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Cys 
50 55 60 

Ala Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 
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Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
15 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Met 
50 55 60 

Ala Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 
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